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OUTLIER EIGENVALUE FLUCTUATIONS OF PERTURBED IID 

MATRICES 

AN AND B. RAJAGOPALAN 


Abstract. It is known that in various random matrix models, large perturba¬ 
tions create outlier eigenvalues which lie, asymptotically, in the complement of 
the support of the limiting spectral density. This paper is concerned with fluc¬ 
tuations of these outlier eigenvalues of iid matrices X n under bounded rank and 
bounded operator norm perturbations A n , namely with A(^= + ^n) — A(A n ). 
The perturbations we consider are allowed to be of arbitrary Jordan type and 
have (left and right) eigenvectors satisfying a mild condition. We obtain the 
joint convergence of the (normalized) asymptotic fluctuations of the outlier 
eigenvalues in this setting with a unified approach. 


1. Introduction 

1.1. Background. Following the works of [3] and [3] investigating the asymptotic 
spectrum of perturbed empirical covariance matrices or spiked population models, 
various efforts have been undertaken to better understanding the outlier eigenval¬ 
ues of perturbed random matrix models. In the Hermitian setting, the works of [7], 
isiminim and m build up to an essentially complete picture of the asymp¬ 
totic locations and normalized fluctuations of the outlier eigenvalues of bounded 
rank and bounded operator norm perturbations. 

This paper obtains the asymptotic fluctuations of outlier eigenvalues for the iid 
matrix ensemble under the same class of perturbations. Before stating our results, 
we introduce the theorem on the asymptotic location of the outlier eigenvalues due 
to |20] after presenting some introductory definitions and results. 

Definition 1. A iid matrix X is an infinite array of (complex) iid random variables 
which we identify with the sequence (X„)„>i, X n = (xij)i<ij< n . We 
assume that the atom distribution x = x\,\ satisfies the moment conditions Eir = 0 
and E|a;| 2 = 1. We let A(V) denote the spectrum ofY and let 

p, n . ^ S X 

n ^ %/s 

AeA(A„) 

denote the empirical spectral distribution of X. 

Theorem 1 (Circular law). For an iid matrix X, we have 

Hx n => He '■= -l{zeC:p|<l} 

7T 1 

almost surely, where => denotes weak convergence. 


l 
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The circular law, which is the work of many authors (see |21j and references 
therein), in particular implies that the spectral radius of X/y/n, p(X/y/n), satisfies 
limsupp(X/y / n) > 1 almost surely. The following is a complementary result; see 
(2] for a proof. 

Theorem 2. Let X n be an iid matrix with atom distribution having bounded fourth 
moment. Then 

p(^=) = ; lim 
y -y Tl J l—yoc 

converges to 1 almost surely as n — » oo. Moreover, for l > 1, ||(^) i || converges to 
l + 1 almost surely as n — > oo. 



Now let A = A n be a deterministic matrix of rank 0(1) and operator norm 0(1). 
We will assume for notational convenience that 0 = 0„ := {A € A(A n ) : |A| > 1} 
is independent of n for n sufficiently large and we let mg denote the multiplicity of 
9. Then the following theorem (due to [20], with generalizations to other models in 
m > m and 0) shows that outliers in the spectrum of + A appear, in contrast 
to the situation in Theorem [2] 


Theorem 3. Let X be an iid matrix with bounded fourth moment and let A and 
0 be as above. For each 9 € 0 there exists 

A 8 C A A+A) 

\Jn 


with | A® | = mg and for A € A®, 


A -> 9 


almost surely. 


To illustrate Theorem[3] in Figure[l]we have plotted the eigenvalues of a perturbed 
Gaussian matrix X/iJn + A, with x having distribution Af(0, l)c and n = 1000. 
The two outliers near 2 correspond to the block (§ °) an d the two outliers near 
1.5 + i are from the block ( 1 g +I 1 5 \_j) of A. Observe that the fluctuations from 
the Jordan block are larger; this phenomenon will be discussed later. 


1.2. Model and statement of results. The focus of our paper is the fluctua¬ 
tions A ~ 9. More precisely, we obtain the limiting distribution of the normalized 
fluctuations when A is allowed to have arbitrary Jordan type and under certain 
sparsitiy and uniformity assumptions on the (left and right) eigenvectors of A. Af¬ 
ter introducing the main definition and theorem in this subsection, we will discuss 


simpler special cases in Subsection 1.3 


We now define the perturbation matrices we will consider in this paper, along with 
associated notation. To unify notation in this paper, for any complex vector z, we 
let 

z (d) := 


(1) 


2 : d = 0 
z : d = 1 
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Figure 1. Eigenvalues of X/y/n + A with X having iid A/”(0, l)c 
entries, A = 2 J 2 © Ji. 5 + 2,2 © O 996 and n = 1000. The smaller circles 
are of radii n -1 / 2 and n -1 / 4 . 


where 2 denotes the (componentwise) conjugate of 2 . We will write z T for the 
transpose of 2 and z* for the conjugate transpose of z. 

Definition 2. A perturbation matrix A = (A n ) n > 1 is a sequence of (complex) 
n x n matrices with rank 0(1) and operator norm 0(1). For 9 £ Q = {9 £ A (A n ) : 
|0| > 1}, let Jg be the Jordan block in the Jordan decomposition of A corresponding 
to 9 with blocks written in nonincreasing order. We will assume that 0 and (Jg)g^e 
are independent of n for n sufficiently large. Let 

K e 

Jo = (J3 J™ k k where J e , k := 
k =1 

is the Jordan block of size k occuring with multiplicity mgin Jg. To index the 
eigenvectors and generalized eigenvectors, we introduce the following notation. Let 

I :={s = ( i,j,k,9 ) : i £ [k],j £ [mg >k ],k £ [Kg], 9 £ 0} 

and for s £ I, we write s = (i s ,j s ,k si 9 s ). Let 

I e = { S £l:9 s = 9}. 

For fixed j, k and 9, let (u s )J : =1 be the generalized eigenvectors corresponding to 
the jth block of Jg,k, and let v\be the eigenvector for that block. Similarly 
define (u*)k =1 to be the generalized left eigenvectors with the u1 k - k ^’s being the left 
eigenvectors. To index the left and right eigenvectors, we let 

lu := {s £ Ig ■ i s = k s } 
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and 

I 8 := {t£l e :i t = 1}. 

Finally, we let 

X 

X 

Z) 

II 


see 


and for re/ 2 , we write r = ( s r ,t r , 9 r ). 


For ( Si,ti,9i ) £ I 2 , i = 1,2, we assume that the limits of the following inner 
products exist and define, ford\,d -2 £ {0,1}, the scalars 

(2) := lim (« Sl )^)*KJ (d2) , 

’ n—¥ 00 

(3) V t [ d ll’ {d2) := lim 

’ n—>• 00 

We a/so assume the following convergence and define (G r ) r& i 2 by 

( 4 ) { u * Sr Xv t r )r£l 2 => ( G r )r£l 2 • 

Lastly, we require the following technical assumption. Fix 6 > 0 and let 

L = U {(hj) e N 2 : l*V,Wt r ,j| > n~ 1/4+5 }. 

r&I 2 

Then we assume 


(5) 


^ U Sr 1 iXijVt r ,j 
(iJ)eL 


rel 2 


{Gr)r£l 2 - 


Remark 1. The eigenvectors satisfying the convergence criteria of ©- © are 
quite general, and are allowed to be of local, dclocal and mixed types (see Re- 
marA:[7p. These eigenvector requirements are similar to those of m and m- 


We denote the Schur complement of A in the block matrix (^ £) by 

sc(A(£E)) .= d-ca- 1 b. 

Recalling the notation of Theorem [3J we denote the elements of A 8 by Af for s £ Ig. 
We now state our main theorem. 


Theorem 4. Let X be an iid matrix and A a perturbation matrix. We will assume 
the moment hypothesis E|a;| m < 00 , with m defined as follows. First define c through 


( 6 ) 


c = sup{c' > 0 : max ||uj||oo||v»||oo < n c }. 
*e[p] 


Then fix e > 0 and set 

(7) m = min(max(2/c, 4), 8) + e. 

Recalling Q, ([ 2 ]) and ([3]), we define the random variables ( F r ) re i 2 by 

(8) F r := G r + g r , 

where (g r ) r ^i 2 is a collection of centered complex Gaussians independent of (G r ) r& i 2 
with mixed second moments specified by 

JEo^ dl )a( d2 ) = (E.r ( C a ,( 2 ^) ^ r (di),(d 2 )y( rf i).( d 2) 

Ur 1 i)r 2 n n it F t ri ,t r „ 


0 ri 0 r2 — x( d A 


( 9 ) 
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For 9 G 0, let F e := ( F r )g r= g be the I® x I® matrix of random variables and for 
k G [K g ], let 

F e ’ k := SC(F 6 \^ Sjt y^ stkt >i <:+1 y,F 6 \^ Sjt y_h s ,k t >k}) 
be the mg, k x mg. k matrix that is the Schur complement of the indicated submatrices 
of F e . Denote the eigenvalues of F e,k by (A j k )™f{ k whose kth roots we denote 

(10) f!j, k := 

where f k = e £ . Then for each 9 G Q, we can label the eigenvalues in as 

(A® ■ fc)(i,j,fc)ez e such that the normalized outlier fluctuations 

( 11 ) 


converge to ( fi,j, k )ee@,(i,j,k)eie * n the following sense. Define the subgroup S of the 
permutation group Si by 


S :={7r G Si : n (s)g = sg,n{s) k = s k and 
7r ( s )j = n (t)j ^ s j = tj f or a H s,t £ /}. 


Let BC(C 7 ) s denote the set of bounded continuous functions on C 1 invariant under 
the action of S. Then for f G BC(C / )‘ S , and writing (fi)i^i for (10) and (fi)i^i 

for dill, 




Remark 2. The moment hypothesis we require seems to be a technical limitation of 
the moment method that we have employed. While we need at most 8 + e moments 
in all cases, we conjecture that 4 moments always suffice. In the delocal case with 
c = 1 (i.e., ||ui||oo; Halloo 1 /y/n, we require 4 + e moments which almost matches 
the conjectured optimal. On the other hand, under the assumption of 4 moments, [B] 
obtains the fluctuations of certain types of local matrices (with c = 0) as described 
in the next subsection. 


1.3. Discussion and related works. We now provide examples of different types 
of behavior for the fluctuations that illustrate Theorem [3] The first two examples 
are of rank 1 fluctuations. 


(i) If A is has a single non zero entry 9 in the top left with \9\ > 1 , the limiting 
normalized fluctuation of the outlier is the law of x + g where x is the 
atom distribution and g is a centered complex Gaussian with E g 2 = 0 and 
K\g \ 2 = Af( 0, |g| 2 _ 1 )c- In Figure pi we demonstrate this non-universality 
in the case 9 = 2 and x as specifiea in the captions. 

(ii) If A = 9vu* is of rank 1 with \9\ > 1 and ||u||oo|M|oo = o(l), then the 
normalized fluctuation y/n(\—8) converges to the law of a centered complex 
Gaussian gg with 




|d| 2 Ex 2 

\9\ 2 -Ex 2 


lim u*uv T v 
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and 

E M 2 = i i lim u * uv * v ■ 

\u\ z — 1 n—too 

In particular, if E:r 2 = 0 and A is normal (thus u and v are unit vectors), 

| 0|2 

then gg is a circularly symmetric Gaussian with variance \g\ 2 _ 1 ■ 

(iii) Suppose A = UDU* is normal of rank k, with ||it* ||oo = o(l) for i = 
1,2 ,,k. For a fixed eigenvalue 9 G 0 of multiplicity m, the covariance 
formula (|9| reduces to 

9 2 Kx 2 

Kg ab g cd = lim K^ u b u d 

u z — n—>oo 


and 


q2 

Kgablhd = -r lim u* a u c u* b u c 

1/ — 1 n—>oo 

° 2 A A 

~ Q2 _ ^°ac°bd 


Note that fluctuations of different eigenvalues are still correlated in gen¬ 
eral. We obtain asymptotically independent fluctuations for distinct eigen¬ 
values in the following cases. 

(a) If A is real, u* a W c = 5 ac , ulu d = 5 bd and the entries of F e = (g ab )™ b= i 
are independent Gaussians. Depending on the Jordan structure Jg, the 
normalized fluctuations converge to the appropriate roots of eigenval¬ 
ues of Schur complements of submatrices of F e as specified in Theo¬ 
rem m 

(b) If Ea: 2 = 0, F 6 = (g a b)™' b —i is a scaled complex Ginibre ensemble with 

atom distribution g satisfying Eg = 0, Eg 2 = 0 and E|g| 2 = jerpy- 
If we now suppose further that Jg = 9I m , then the m fluctuations 
associated to 9 are given by the eigenvalues of the complex Ginibre 
ensemble specified above. By the circular law, they lie approximately 
uniformly in a disk of radius —— r for m large. 

(|ep-i)2 

(c) So far, the fluctuations have been of order 0{^=). Suppose again 
that Ex 2 = 0 but that Jg is a single Jordan block of size m. Then as 
remarked below Proposition^ the m fluctuations scaled by n 1 ^ 2m ' 1 are 
given by (e 27rlJ / m g^ m )^ ) 1 where gg = ( F e ) m i is the lower left entry 
of F e . Hence the fluctuations are distributed uniformly around a circle 
of radius vT 1 ^ 2 ^gj m . This dependence of the rate of convergence on 
the size of the Jordan block is illustrated by the outliers in Figure [T] 


In (H], the outlier eigenvalues of perturbations of the single ring model are studied 
and their locations and limiting fluctuations are obtained (' [151 Theorem 2.9]) for 
finite rank and finite operator norm perturbations of arbitrary Jordan type. Note 
that the special case of the Ginibre ensemble, which is an iid matrix, is contained in 
this model as well. Our approach to dealing with perturbations of various Jordan 
types is similar and relies on a deterministic perturbation result known as the 
Lidskii-Vishik-Lyusternik perturbation theorem (see [13], [22, [HI and references 
therein) which we have reproduced in Appendix [Aj 
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Figure 2. Figures 2a and 2c are 500 samples of the normalized 
fluctuations y/n(X out (+ A) — 2) of a single outlier with n = 
100 and A given by a.j. : j = 2<J(jj) = ni). In Figure 2a, the atom 
distribution x is distributed uniformly over the square [— l, l] 2 C C 
with l = so that E|cc| 2 = 1 (outlined in figure). In Figure 

2c, x is the standard complex normal 7V(0, l)c- Figures 2b and 2d 
are 500 samples from the corresponding limiting distributions as 
predicted by Theorem lil and detailed in case (i). 


In [B], Bordenave and Captaine study asymptotic outlier locations and fluctuations 
for perturbed iid matrices. The perturbations considered there are of the form 
A = A' + A" where A" is of bounded rank and A' (with possibly unbounded rank) 
satisfies a well-conditioning property. In the case of local perturbations, where A 
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has a finite nonzero block A" at the top-left, j6i Theorems 1.7 and 1.8] obtain the 
limiting normalized outlier fluctuation when A" = #l r k(A) and when A" = Je. r k(A) 
under the hypothesis of bounded fourth moments. 

In the case when A" = vu* is of rank 1 and is delocalized (||m||oo>[M|oo = 
0(n -1 / 2 )), they show that the outliers exhibit macroscopic fluctuations and demon¬ 
strate a convergence of these fluctuations to the zeros of a Gaussian analytic func¬ 
tion. While this phenomenon does not occur with finite rank perturbations, some 
techniques of the proof are similar to the ones in our proof. 

In the setting of finite rank perturbations of iid matrices, when Theorem [4] is spe¬ 
cialized appropriately, our results coincide with m Theorem 2.9] for the Ginibre 
ensemble and with 0 Theorems 1.7 and 1.8] for local perturbations of the speci¬ 
fied Jordan types. All other cases however, with X having a non-Gaussian atom 
distribution and A having general eigenvectors (see Remark [l]), including the de¬ 
localized cases of (ii) and (iii), do not appear to have been explicitly addressed in 
the literature. 

The main technical result of this paper is Proposition [l] which we prove using the 
moment method. We require a bounded number of moments in all cases and are 
able to obtain the limiting fluctuations in a more general setting with a unified 
approach. 

The paper is organized as follows. In Section [2] we prove Proposition |T] which 
characterizes the joint asymptotic distribution of certain random variables aris¬ 
ing from powers of X n appearing in the Neumann series of (X n /t/n — A) -1 . In 
Section [3] we prove Lemma [7] which determines the joint limiting distribution of 
random variables related to a normalized resolvent of X n , namely of the form 
y/nu*[(X n / y/n — A) -1 + A _1 ]u. Using Lemma [ 7 } T heorem [ 4 ] is proven in Section [IJ 
with the help of Proposition [2] from Appendix |A[ a deterministic perturbation re¬ 
sult needed to understand the effect of Jordan blocks in perturbations. Appendix [B] 
presents the truncation argument that allows us to assume stronger hypotheses in 
Proposition [l] and Lemma [7] 


1.4. Acknowledgments. I am indebted to my advisor, Terence Tao, for his con¬ 
stant guidance, support and feedback throughout the course of this work. 


1.5. Notation. In this paper, n will be a parameter going to infinity and many 
quantities will be implicitly understood to depend on n. We will use the asymptotic 
notation X = 0(Y) and X <C Y to mean there is a constant C independent of n, 
but possibly dependent on other parameters, such that A' < CY for sufficiently 
large n. Similarly, we write X = U(U) to mean for some C and sufficiently large 
n, X > CY. We write X = o{Y) to mean linin^oo X/Y —► 0. For a sequence of 
events E = E n , we say E occurs with high probability (w.h.p.) if P (E n ) = 1 — o(l) 
and with overwhelming probability if 1 — P (E n ) <C n~ c for all c > 0. We will use 
=> to denote convergence in distribution (and occasionally to denote implication) 
and finally, we write [k] for {1,2 ,..., fc}. 
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2. A CENTRAL LIMIT THEOREM |T| 


To obtain the limiting fluctuations of the outliers in Theorem [4j we will have 
to derive the joint asymptotic distributions for certain bilinear averages of the 
recentered and normalized resolvent, namely for 


( 12 ) 


S%’ v ^-XVnu'dX/y/K-Xy' + X- 1 )! 


with u and v ranging over the generalized eigenvectors of the perturbation matrix 
A. To this end, in this section we prove Proposition [l] which obtains the limiting 
joint distribution for a bounded number of terms of the Neumann series of (12). In 
Lemma [7] we will control the tail of (|12|), thus obtaining its limiting distribution. 


Recall the notation introduced in ([I]) which we reproduce here for convenience. 
For any complex vector z, we let 


: = 


: d = 0 

: d= 1 


For S C [n] x [ra], we define Xs = (XfA through 




Proposition 1. Let X be an iid matrix and {v*i,Vi)?_ 1 = (u[ n \ 


quence of vectors in C’ 
in the place of ( u Sr , Vt. 
following limits and define the scalars 


We assume the hypotheses of Theorem 
r£i 2 - Thus, in the place of (|2| and 


m J\ wi 
((31, we 


be a se- 

p 


with (ui,Vi)l’ =1 


we assume 


the 


(13) := lim {u il )^>{uJ d2 \v il )^ T {v i ^ 

L ’ z n .—Von 


We will assume E|x| m < 00 with m defined via ([b]) and ([T]). 

Define 

where we have suppressed the n dependence for X n , u[ n ^ and v^ n \ Also, for 

L := |J {(k,l) e [n] x [n] : \u itk v it i\ > n~ 1/i+s } 

*e[p] 


and L c := ([n] x [n])\L, define 


zij ■■ 


x L ' 3 


Z% ■= Vnu* ( I Vi. 


and 
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For j = 1, we will assume that the following joint convergences in distribution and 
define the independent families (Gf 1 )^ =1 and (Gfh)^ through 

(^i)Li => (g^)Li 

and 

{z£l)U =* [G^)U- 

Also define Gi_i := Gf ± + Gfl so that 

(14) (^i)Li =* (Gi,i)Li- 

Then for any fixed m> 1, the pm random variables (Zij)?=-l j=i converge jointly in 
distribution to the law of random variables (Gij)^tf) j =1 with (Gijjff)) J=2 specified 
by 


(i) The Gij’s are centered complex Gaussians for j > 2 with mixed second 
moments given by 


(15) 


EG^Gg = S jk (Ex^x^yC^ d2) . 


(ii) The collections of random variables [Gi t i)f =1 and (Gij J=2 are inde¬ 
pendent. 


Note in particular that for j ^ k, Zi x ^ and Zi 2 & are asymptotically independent. 

Remark 3. We note that the case p = 1 and c = 1 is a generalization of m 
Section 4] to the complex case with weaker moment assumptions, and is a special 
case of |T[ Theorems 6.3, 6.4]. 

Remark 4. The assumption of the joint convergence of Z k ± and Zfj x is satisfied 
under various conditions. We describe some of these below. 


(i) If each Ui and have finite support in [G] independent of n, we have the 
case of a local perturbation and the G^i ’s are finite linear combinations of 
the Xij’s. 

(ii) If each Ui and i>, is uniformly delocalized in the sense that ||ui||oo = o(l) 
and 11v*|loo = o(l) for i £ [p], then by the classical central limit theorem, 
the Gi, i ’s are joint centered complex Gaussians with mixed second moments 
given by 


EG^Gg = E x ( d A x GAc£)l’ {d2) . 


(Hi) Each Ui and Vj can be allowed to have a local and a uniformly delocalized 
part. Namely, we suppose that for some C independent of n and all i £ [p], 
su Pj>C \ u i\i su Pi>c \ v i\ = °(1)- this case, the G^i ’s are a sum of a finite 
linear combination of the Xij’s and an independent Gaussian. 

(iv) Finally, we mention an example that is not contained in the above cases. 
Let p = 1, fix 0 < r < 1 and set u \^ = V\^ = r k c n with c n chosen such 
that u*v = 0 := 2 say. Then G ij is an infinite linear combination of the 
Xij’s with exponentially decreasing entries. 
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2.1. Proof of Proposition [Tj Instead of assuming 0 , via a truncation argument 
presented in Appendix [E it suffices to prove Proposition [l] under the stronger 
assumption that the atom distribution x satisfies the bound |ar| < K := o(n M ) 
with M = 2 /m given by 

(16) M = max(min(c, 1/2), 1/4) — e, 

with c defined by Q. Furthermore, by decreasing c slightly (and decreasing e), we 
may assume 

max ||iti||oo||vi||oo < n~ c 
»e[p] 

instead. We will also assume without loss of generality that (uj, i>i)j£[p] are unit 
vectors. 

In step 1, we show that {Zl\ )/ =1 is asymptotically independent of 

In step 2, we derive the joint asymptotic distribution of {Zjf x )f =1 U j— 2 - 

A key part of the proof is contained in Lemma [5j whose proof we postpone to the 
end of this section. 

Step 2 employs the moment method which, together with the truncation method 
(see Appendix [b]), contributes to the moment hypothesis. The moment hypothe¬ 
sis decays when the random variables (Z/j )/ =1 are dealt with using the moment 
method; thus we deal with them separately. 

We will need 

Lemma 1. Let A^ = (A^,..., A^), B^) = and C^ = 

{C[ n \ ..., C[ n ^) be sequences of complex vector valued random variables such that 

(A^ n \C^) => (A,C) and B^ -> P 0. 

Then (A^ + B^ n \C^) =>• (A,C). In particular, if A( n ) andCare independent, 
then A( n ) + B^ n ' > and C( n ' > are asymptotically independent. 


Proof. This follows from the Cramer-Wold device (see [5} Chapter 1.7]) and appears 
in 0 Exercise 1.4.2]. □ 


2.1.1. Step 1. For j > 2, define 

ZB : = n -L'- 1 )/ 2 w *(x _ x L y Vl . 

Note that Z-f ^ and Z\ ^ are functions of disjoint subsets of {x rs : r,s £ [n]} 
and hence, (Zij)(i,j)eD and c are independent. Zf^ and Z^ - 2 are 

independent for the same reason. 

By Lemma [T] it suffices to show that 

(17) E = E id := „-Cj _1 )/ 2 u* (X 3 - (X - X L y )v t -»• P 0 

for i £ [p\ and 2 < j < m. We will need the following result. 
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Lemma 2. Let u and v be unit vectors in C n and X be an iid random matrix with 
atom distribution having mean 0, variance 1 and bounded fourth moment. Then 

2 


(18) 

for any fixed k > 1. 


E 


1 


-X 


= o\- 

n 


Remark 5. Lemma [2] is a special case of Lemma [5] which establishes the same 
statement for k that is allowed to grow polynomially with n. We postpone the proof 
to Subsection \3.1\ where the result is needed in full generality. We remark that 
Lemma can also be found in HU Lemma 2.3]. 


Fix J > 2 and let S n = logn (any slowly growing function of n will suffice). By 
Lemma [2] and Markov’s inequality, for any k > 1, 



occurs with high probability for any finite set of 2 M unit vectors (u m )^_ 1 and 

(v m )%= i- 

Recall that 

L := (J {(M) S [n] x [n] : \u itk v it i\ > n~ 1/4+d } 

*6 [p] 

where 6 > 0 is fixed. Since |uj |2 = \vi \2 = 1, we have |L| <C n 1 ^ 2-25 . To control, 
||A'l||, we will need 

Lemma 3. Suppose S C AxB with max(|A|, |B|) < m. Then ||AC's|| < 0(\ogn^/rn) 
w.h.p. 


Proof. Since ||ACs|| is unchanged when restricting Xg to an m x m submatrix con¬ 
taining S , we may assume m = n. If S = 0, Lemma[3]is a consequence of Theorem[2] 
Writing X' := Xl — X^o, we have 

||A i ||<i(||A|| + ||A"||) 

from the triangle inequality. If the atom distribution x is symmetric, applying 
Tlieorem[2]to A' and X' yields the desired bound. To prove the lemma for general x, 
we will need a symmetrization argument from ns Section 2.3.2] that we reproduce 
here for convenience. Letting X" be an independent copy of A 7 , we have 

E[A' - A"| A'] = A'. 

Since the operator norm is a convex function, we may apply Jensen’s inequality to 
get 

IIA'|| < E[||A' — A"|||A / ]. 

Removing the conditioning on A', we have 

E|| A'|| < E|| A' — A"||. 
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Now X' — X" has iid entries, so applying Theorem [2j we have 

P[||*'|| > \ognVn] < 

log ny/n 
^ E||X'-X"|| 

~ log ny/n 

= o(l). 


□ 


Applying Lemma [ 3 ] with m = n 1//2 2S gives 
(20) ||X L || < (logn)n 1/4 " 5 w.h.p. 


Now let 


Expanding fl7|), we have 


X a := 


X : a = 0 
X L :a =1 


3 

\E\ E n -(j-1)/2 \u*X ai ...X^Vi] 

a=l ai,”* ,ajG{0,1} 

IZ ai—a 


=--J2 e > 


For a > 2, 


E a < 


fj) 

X 

J-a 

Xl 

U J 

y/n 


yfn 


\\x L \\ 


= o(l) w.h.p., 


where we have used (20) and that a > 2. 


To bound E\, we have 
3 -1 

^i< E 


X 


< hr ^ I 


m=0 

< E E i*w 

(fe,;)eL m=0 
6 ; 


X 


j-l-m 


Vi 


X 

\fn 


ek 


X 

\fn 


j-l-m 


Vi 


< -y= E w - h -p- 

Note that if j > 2, then either m > 1 or j — 1 — m > 1 for 0 < m < k — 1. Hence 
the last line follows from (19). 


Since E|x’ fc; | < 1, 5 n = logn and \L\ = O^i 1 ^ 2 2S ), we have ^ ->p 0 by Markov’s 
inequality, and follows. 
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2.1.2. Step 2. We first state and prove the complex version of Wick’s theorem (also 
known as Isserlis’ theorem, see m) which will be needed later. 

Lemma 4. (Complex Wick’s theorem) 

Let (Z i, Z 2 , ■ • •, Z n ) = (Xi + iY±,..., X n + iY n ) be a centered complex Gaussian 
vector. Thus the vector (Xi, Yi,... , X n , Y n ) is multivariate normal. Then for any 
I = (h,... ,i 2 k ) S [n] 2k , 


2k k 

r ii a \'ir'A : ••y 

1=1 p 3=1 

where the sum is over all partitions P = [J , j—i{p 2 j-i, P 2 j} of [2fc] into pairs. Also, 
the left hand side is 0 if I has odd length. 


Proof. Wick’s theorem is the statement of the lemma for multivariate centered real 
Gaussians. The complex version follows by expanding both sides of the equation 
into real and imaginary parts and applying Wick’s theorem. Let 


Then 


while 


Wf 


Xi : a = 1 
iYi : a = 2 


2k 2k 

K U Z n= E UK 1 

l — 1 <2i Z = 1 


E n nz^-.z^] = e E n 


P i=l 


P ai,...,a 2 fcG{l, 2 } i —1 


Switching the sums and applying Wick’s theorem to E nf=i for each choice of 
the a {s yields the result. 


□ 


We now prove Proposition [I] for the collection of random variables {Z k [) p i=1 U 
(Z'l.jYi^" ;/= 2 - This part of the proof employs the moment method in a similar way 
to those in [20] and [6]. To avoid notational clutter on a first reading, one may set 
p = 1 to grasp the main ideas of the proof. 


To handle the j = 1 case uniformly, in the proof we will abuse notation by writing 
Z, : \ for Zf\ and for When j = 1, we will denote Xj/ by X 5 and finally, 
we define 


( 21 ) 


'C^ dl )_’ ( ' d2 ' ) : j > 2 




,lim„^oo E(fc,/)ei<= («*i,fe) (dl) («*2,fc) (d2) ( u * 1 ,0 (<il) ( v *2,0 (d2) - 
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By Carleman’s theorem for the case of a complex vector of random variables (see 
e-g- HD, it suffices to show that the multivariate mixed moments converge. Namely, 

(22) E [] zzfzrj 8 *'* = E n ar*/G%+o{ 1) 

l<i<p l<i<p 

1 <_7 < m l<j<m 


for (r- ) p,m (s- ) p,rn 

lOI ,7=1 ? ^,^=1,7 = 1 


G N pm . 


Let Q i 
(23) 


— j{j ~ 1 + Sij ). Then the left hand side of (22) is 


n” Ql E H )'“' U 

1<2<P 

l<j<m 


Expanding the product in (23) will yield terms corresponding to the union of 
directed paths on the vertex set [n] with JN + Sjj of them having length j 
for each 1 < j < to. We first introduce notation in order to write (23) as a sum 
n-® 1 J)* W(F), with * and W(F ) defined appropriately. Next, we reduce the 
sum to terms with paths having multiplicity two and disjoint interior vertices (see 
Lemma [5]). Finally we apply the complex Wick theorem to obtain the proposition. 

Let 

S := {(a, b,c,d) : a £ [ p\,b £ [to], d £ {0,1}, c £ [r a ^] if d = 0 and c £ [s a ,b] if d = 1} 

be the index set for the Z, t j's. For s £ S we write s = (s a , Sb, s c , Sd)- Recalling ([!]), 

(231 can be written as 


(24) 
We let 


n~ Ql E n« 0 XSbt O (Sd) - 


sGS 


T := {(s,e) : s £ S and e € [sb+i]} 


be the index set of terms within the Zij’ s. For t £ T. we write 

t — (^s,^e) — (^a, ^b, tc: tdi ^e)- 

By a slight abuse of notation, we will write u t for u ta and u s for u Sa . We denote 
the index set for terms in the expansion of (24) by 

F := {F : T -A [n] : t b = 1 =► (F(t, 1 ),F(t, 2)) e L c }. 

Finally for s £ S and F £ F' let 

(25) 

Sb 

Ws(F) := (m Si f( Si i)We(s,S!>+1)1[s(,> 2 or (F(s,l),F(s,2))eL c ])^ (® Jf x F(s,e) ,F(s,e+l )) ^ d ^ 

e=l 

(26) 

=: W a>M (F)W., x {F) 

and set 

(27) W u ,v(F) ■= n W 3j ( UtV )(F), 

ses 
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and 

(28) 


W X (F) := ll W S:X (F) 
ses 

W(F) := W S (F). 


Now we can write (241 as 

(29) 

sGS 


ses 


= n Q 


1 E w ( F )- 


FeF' 


For each partition T = {Xi,..., T q } of T, set 

F t ~{FgF: {F~\i) : * £ [n], F -1 (i) ? 0} = {I\ ..., T q }} 

to be the set of terms F whose preimages induce the partition {T-\.... ,T q }. We 
can now write 


,-Q 


= E w ( p )- 


FeF 


r={T lx ..,T q } F&J^t 


We now define notation for the edges of the graph induced by the terms F. First, 
let E := {(t, t') £ T 2 : t s = t' s , t' e = t e + 1} and fix a partition T = {Xi ..., T q } of 
T. For F £ Ft and i,j £ [g] = [g(T)]> let 

Ejj := {e = £ E : t £ Tj and t' £ X,} 

and let 

E T := {*£• : |£&| > 0}. 

Note that (|e|) e eE T is independent of F £ Ft and that 

(30) W X (F)= H E\x\^. 

eeE T 

Since E|x| = 0, W X (F) = 0 if |e| = 0 for any e £ Et■ Thus defining 

F := U F t , 

1 ~ partition of T: 

|e|>2 WeeE r 

we have 

(31) n~ Ql E W ( F ) = n ~ Ql E W ( F )- 

FeF' fef 


Each F £ F can be interpreted as a union of paths on [n]. More precisely, letting 
T s := {t £ T : t s = s}, we define 7 tf, s '■= X|r s to be the path of F corresponding 
to term s £ 5. The interior vertices of 7 tf, s are defined to be F({(s,e) : e = 
2, 3,..., Sb}). 

Lemma 5. Assume the hypotheses of Proposition^ and recall the notation intro¬ 
duced above. Let Fq be the set of terms F such that each path iTp s for s £ S has 
multiplicity 2 and different paths have disjoint interior vertices. 
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Then 

n~ Ql W ( F ) = n ~ Ql E W ( F ) + °( 1 )' 

FgF FgF 0 


We will postpone the proof of the lemma to the end of the section. Assuming the 
lemma, we now prove the proposition. 


First suppose rj ? + .s li3 is odd for some j. Then Fq is empty and the left-hand 
side of (22) is o(l) which matches the right-hand side by the vanishing of odd mixed 
moments of a centered complex Gaussian. For the rest of the proof, we can thus 
assume that for each j , r l: j + Sij is even. 


We group the terms in Fq as follows. Let Sj := {s £ S : s b = j} and define Vj 
to be the set of unordered partitions of Sj into parts of size two. Note that by 
assumption, Sj is even for all j. 


For F £ Jo, note by (25) and (28) that W(F) does not depend on the interior 
points {/(s, e) : s £ S, e = 2,..., s^}- There are ]TL .(j — 1 )(r ij + Sjj) such points 
which occur in pairs and can be chosen in n® 1 ways. 


For F £ Fq and j £ [to], let Pp,j £ Vj be the partition of Vj induced by F. Then 
F satisfies the condition that for each part {p,q} £ Pfj , F(p, 1) = F(q, 1) and 
F(p,p b + 1) = F(q,q b + 1). 


Summing over the choices for interior points and F satisfying the above condition 
instead of summing over F £ Fq incurs an o(l) error and we have 


(32) 


■,-Q 


1 ^W(F) = []E II E W p (F)W q (F) + o(l) 


FGFo 


7=1 PjeVj {p,q}£Pj F(p,l)=F(q,l), 

F(p,j+l)=F(q,j + l)e[n] 


where, recalling (25) 


Wp(F)W q (F) = (Ex^x^y n <F(„ 1 )Ef(e 7 + 1) 1 [7>2 or (F(r,l),F(r, 2 ))eLc 

re{p,q} 


Finally, using (13) and (21), (32) evaluates to 


( 33 ) n e n (e x M x lqd) yc£f} qd) (j) + o(i). 

7=1 p j £Pj {p,q}e p j 


On the other hand, we let V be the set of partitions of S into pairs and for s £ S, 
we set G s := Note that for j ^ k, E G ilt jG i2 ^ = 0 and hence G ilt j and G i2: k 
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are independent. Applying Wick’s theorem to the right hand side of (22) gives 


e n <ZjG?f=®i[G a 


1 <i<p 

1<j <m 


ses 

m 

=n E ii G * 

3 — 1 sESj 
m 

=n e n eg p°« 

J =1 Pj^i {p,q}£Pj 


where we have used Wick’s theorem in the third line. Comparing (33) and (15) 
then concludes the proof of the proposition. 

Note the following special cases of Proposition [IJ where we write Gyi for Gf'J. 


(i) If Ex 2 = 0, condition (15) becomes 


(34) 


EGiLjGh,* = <y j -*C' (0) - (1) (t lj i 2 ) 


and 


EGi lt jGi 2t h — 0 . 

(ii) If we further assume that for p = d 2 , the vectors (itj, Vi) p i=1 are of the form 


(u a , Ub)a 5 =1 with (u a )a =1 orthonormal, then (34) reduces to 


(35) 


G( C)C ;y — djk&ab&cd- 


2.2. Proof of Lemma [5| Fix a partition T = {Tf,..., T q } of T with \e\ > 2 for 
every e £ Ep. We first rewrite the sum W(F) as a product of terms 

over j £ [g]. 

Define T 1 := {t £ T : t e = 1}, T 2 := {t £ T : t e = t b + 1}, T 3 := T\(T X U T 2 ) and 
let Tj :=Tj C\T l for l = 1, 2, 3. For IgT and < £ [n], define the vertex weights 


(36) 


\ut,i\ : t £ T 1 
w(t,i) := ^ \v t ,i\ :t£T 2 
n -i/ 2 :ieT 3 


The w(t, i )’s account for the factors n ^ and W UfV (F) in (271 and (291 respectively. 
Since E|x| a <C A'( Q_4 )+, using (30) we have 


■,-Q i 


E ^( F ) 

Fe:F m. T q y 


^ e nnn i<(v 


1-4)4 


distinct 


eeE r 


( 37 ) 


< e nn-(Mi) n* (iBM)+ - 

«i,...,i,e[n] \j=lteTj ) e£E r 
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We would like to bound ELe-Er A ^ (|e| 4)+ by liter ^*(Mj(t)) for some suitably 
defined K* in order to bound the right-hand side p7| ) by 

Q 

II 55 Y[w{t,ij)K*(t,ij). 

j —i ti, ...,* ? e[n] teTj 

We do this first for the expression W eeEr in order to motivate some of the 
technical definitions. Fix £ [n] and assume for t £ T and j £ [g] that 

\ut,ij\,\ v t,ij \ 7^ 0. Recall the parameter c £ [0,1] from 
and t 2 £ Tj, define 

(K~ (1_e) l :ieTl 

(38) K(t,i) := Ot;-g- e) | :t£T 2 

[ma x(K 2 n~ c ^ 1 ~ e \ K) : t £ T 3 

We first show that 

(39) n* w «nn*(M,). 

eGE r j=lteTj 


2.1). For t £ Tj, V £ 


T J 


Fix s £ S. Suppoes Sb = 1. Then for S and e sufficiently small, 


Y\ K(t,ij( t ))> f min "\ut t kV t ,i\ (1 e) 


teT:t s =s 


(40) 


(fc,J)6i c 

n max ( 1 / 4 - <5 > c )( 1 - e ) 
> K. 


The last line follows from M < max(l/4,c) which is a consequence of (161, . 
If > 2, 

n K &m) » ^ 2 » c(1_e) (ll« t ||oolkt||oo) _(1_e) I< sb ~ 2 

t£T:t s =s 

(41) > K Sb 


where we have used ||^t||ooIl'U't||oo "C n~ c . Using (40) and (41) and taking the 
product over s £ S gives (39). We now define K*(t,i ) in such a way that we have 
the analogous bound 


(42) 


n a'(n- 4 ) + « n n k *^)- 

eG-Er 3- 1 t&Tj 


First, order the elements of Tj = {t[,t l 2 ,... ,if T i|} arbitrarily for l = 1,2,3. We 
define the set Cj C Tj by the following conditions. 


(i) t 3 k £ Cj k < 2. 

(ii) For l = 1, 2, t[ £ Cj <*=> k + |T?| < 2. 
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It is easy to verify that \Cj\TH, \Cj\Tf\ < 2. We now define 


:= 


1 : t £ Cj 


K(t,i) : otherwise . 

We now prove (421. Fix e £ Ej- and suppose e C T) x Tj. Define e' C e by 
e' := {{s,t) £ e : s £ Ci or t £ Cj}. 

Since \Ci\T?\, |Cj\T^| < 2, |e'| < 4 and we have 


It thus suffices to show 




eer 


eer 


n afvi < n n k *^ 

j —1 


eer 


As in the proof of (39), we fix s £ S. Let C := UjeM an d define 

e s = {((s, l ), (s, l + 1)) : 1 < l < Sb and (s, l), (s, l + 1) ^ C) 

and 

v s = {(a, l) : (s, l) C and l = 2,3,..., s b }. 

Since K(t, i) > 1 for t = ( s , 1) and t = ( s , Sb + 1), it suffices to show 

A de s | < JJ K(t,i). 


t£v s 


If \e s \ = Sb, this follows from (40) and (41). Now suppose |e s | < Sb. We first show 
that |e s | < |u g |. Choose l* such that (s, /*) £ C and define the map / : e s —>■ v s by 


f((s,l),(s,l + 1)) := 


(s,l + 1) : l < l* - 2 
(s,i): i > r + 1 


We see that / is injective and hence |e s | < |p s |. Since K(t,i) > K for t £ v a , we 
have 

A'l e *l < K m 

< nmi) 

t£v s 


completing the proof of (42). 


We can now use \A2\ in (|37|) to write 

r~Q 


n 


E w ( F ) 




IT! . Tq } 


(43) 


< E n II w(t,ij)K*(t,ij) 

i!,...,i q e[n] j =1 tGTj 

=n e n w(t,ij)K*(t,ij) 

3 = 1 E[n] tGTj 

J=1 


(44) 
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We now fix a part of T, say Ti and consider W*(T{). To prove Lemma [5| it suffices 
to prove the following. 

Lemma 6. (i) W*{T{) = 0(1) 

(it) If ITpl > 1, then ]W*(Ti)| = o(l) unless \T(\ = |T\| = 2. 

(Hi) W*(Tj) = o(l) unless \e\ = 2 for every e £ E-y- 


Proof. We first show that 

(45) w(t,i)K(t,i) = 


1 0(1) : t £ T-f U Tf 
l o(l) = t £ Tf 


using (36), (38) and (16). Suppose t € T 1 . Then w(t,i)K(t,i) < \ui\ e = 0(1). We 
have a similar bound for t £ T 2 . Finally, if t £ T 3 , then 

w(t,i)K(t,i) = n -1 / 2 max(AT 2 n _c ( 1_e \ A). 

Since A = o(n M ) and M < min(l/2, c), we have the desired bound. This implies 
in particular that for any D C Ci, 

(46) ^vikeii^,')- 

ie[n] t£D 


vvc JJiUVC 

estimate 

(47) 


E Kl e « o^ 1 -*/ 2 ) 

i6[n] 

which follows from Holder’s inequality. Suppose |Tp| = 1. Then, since each edge 
has multiplicity at least 2, we must have |T' 1 1 |, \Tf\ > 1. Applying (46) with D = 
Ci = we have that for some (u,v), 

n 

W*(T!)<^\ 


l, 2 \Ui\\Vi\ 


2—1 

< 0(n" 1/2 ). 


If |Tp| > 3, then C\ = and 

W*(Ti) < E n _1 A'*(t|, i) 

iG[n] 

= 0 ( 1 ) 

by (45). Finally, suppose iT-fl = 2. Suppose \T(\ > 1. Then from (47), we have 

W*{Ti)< E = o{l). 

iS[n] 

We have a similar estimate if \Tf \ > 1. We conclude that if |T^| > 1, W*(Ti) = o(l) 
unless |Tp| = 2 and |T/| = \Tf\ = 0, in which case W*(Ti) = 0(1). 

We now prove (iii) Assume first that e is an edge incident to distinct vertices, 
say e C Ti x T 2 , and that \e\ > 3. By (ii) we may assume Tf = Tf = 0. Since 
Till |T 2 | > 3, we may choose (sj,tj) G e for i = 1,2, 3 where Sj £ Tf and ti £ Tf 
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and let C\ = {si,S 2 } and C 2 = {ti,^ 2 }- Then bounding W*(Ti)W*(T 2 ) by the 
contribution from we have 

2 

W*(Ti)W*(T 2 ) < E IIk^ii wwr 

(i,j)€L° k— 1 

< max \u S3 ,iV t3 ,j\ e E l u *i,ilK 2 .il E KulKjl 
{lj)eLC je[n] 

= o(l). 

We have a similar bound if e is a loop at say Tf. 



in the cases not 


W*(Ti)< E Ws,iUtA 

ie[n] 

= 0 ( 1 ). 


□ 


3. Proof of Lemma [7] 


Recall the bilinear average of the normalized resolvent introduced in (12) in Section 


[2] In this section we control the tail of its Neumann series and, with the help of 
Proposition [l] obtain the joint limiting distribution of such terms in Lemma[T] This 
is the main ingredient in the proof of Theorem [4] which is presented in the next 
section. 

Lemma 7. Fix complex numbers 61 ,... ,9 a with \8j\ > 1 for j £ [a] and suppose 
A j = X n j —tp 9j as n —)• 00 . Let be p pairs of vectors satisfying the 

hypotheses of Proposition Q} Let 


s.j ■■= £ 




k> 1 




^i 7 k 
\k ' 
k> 1 3 


Recall the definition of (Gi t i)f =1 from Proposition [7] and define centered complex 
Gaussians { 9 i,j)i=i independent of (Gi t i)f =1 with mixed second moments given 
by 

(Ass') IJ.W) _ (Ex^ K l) 7-r( d i)>( d 2)T/-(di),(d 2 ) 

(48) Eg hj 9i/J , - 9A , m , ^ Ex(dl)x(d2)) U ' - ^ , . 

Then 


(Si,j)i= i, i= i =► (^,,)?= Q i J= i 


where 


J = 1 v 

GiA 
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To prove the lemma, we split Sij into three sums as follows. Fix cutoffs m > 0 
and T n = log 2 n (T n = w(logn) suffices) and define 


S it . 


m rr T n — OO „ 

V — + V — i+V^t 

/ y \fc T \k ' Z-* \fc 

fc=l J fc=m+l i k>T n i 


=: ST 


S« 


We define 


m „ 

(so) US=-E-sr 

fc=l J 

where the G% are defined as in the statement of Proposition b Note that TjA is 
independent of n. 


By Proposition [I] and the multivariate version of Slutsky’s theorem (see 0), 

where the joint convergence is over all i £ [p], k £ [m\ and j £ [a]. By the continuous 
mapping theorem, (S(A) => (T^) jointly for i £ [p] and j £ [a]. By the definitions 
of TA in (501 and of in (14) and (15), and by inspecting (48) and (49), we see 
that 


rriA "WOO „ 

^ “i,j 


jointly. 


To prove Lemma [7J it suffices to prove 

Lemma 8. (a) limm-^ lim n _>oo E|S'"® | =0 and 

(b) lim 

m—too linin-^oo Els ' 0 1 = 0 . 

where we have suppressed the i and j dependence for Sfj and Sfj. 


Define the event 

(51) E n := {\X nJ - 0j\< Sj := for all j £ [a]}. 

By hypothesis P(-E„) = 1—o(l) so it suffices to prove Lemmaj7](and hence Lemma|8|) 
on E n . In the following, we fix an index j and set S := . Note that we have 

(52) |A| >1 + ;[(|0|-1). 

We prove Lemma first. 


Proof. Recall that on E n , |A| >1 + 35 (see |5l|)). By Theorem[2] p(X/y/n) <1 + 5 
w.h.p. and we can choose l such that < 1 + 25. We may assume without 

loss of generality that these events occur on E n . By submultiplicativity of the 
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operator norm, 



✓ \ / 

Lx J 


(XV 

( x V 


max 

0 <i<l 



< Oi( 1 + 2(5) fc w.h.p. 


By the Cauchy-Schwarz inequality, we have 


\ s c\ < 




\u\2\v\2 


k>T n 


< 


OlWn) Y. I 


k>T n 


|A|* 


1 + 25 
+ 3(5 


= o(l) 


where the last line follows from our choice of T n = log“ n. 


□ 


To prove Lemma [8^,, we will need 
Lemma 9. Let u and v be unit vectors in C n and set 


Z k := \fnu* 


X v. 


Fix e > 0 and assume \x\ < K = 0{n 2 ). Then there exists c = c(e) > 0 such 
that for all k <C n c , 


(53) 


nz k \ 2 = o(i). 


Assuming Lemma[9]we prove Lemma[8^ on E n . Since (E|Z|) 2 < E|Z| 2 , we have 

E|S B I< E e!+! 

k=m +1 1 1 

«|i + jj(|0|-i)|- 2m 


where we have used Lemma [9] and (52) in the last line. Lemmata) follows from 
letting to -A 00 . 

Remark 6. Note that by the truncation argument given in Appendix\B[ Lemma^ 
and hence Lemma [^Ja, is valid under the moment hypothesis E|;r| 4+e < 00 for any 
fixed e > 0. 


3.1. Proof of Lemma 9. In this subsection we prove Lemma[9] 


Proof. It suffices to show 
(54) 


E| u*X k v\ 2 = 0(n fe_1 ). 
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Let 


and 


T := {(a, b) : a = 1,2, b = 0,1,..., k}, 
T’ :={(a,b) eT :b<k} 


E := {((a, b), (a, b + 1)) G T 2 : b < k}. 

Let Tp := T\ a -\, Tq := T| a=2 and for t G T', set t s := (a, b+ 1). We will designate 
the terms in the expansion of (54) by 

V’ := {F : T-> [n]}. 

For F € V , let Fp := F\p P and Fq := F\t q . Let 

W U , V {F) : = |«F(l,0) u F(2,0) w F(l,fc) t; F(2,fc)| 

and 

W X (F) := E| ^F(t),F(t s )|- 

teT' 

Then we have 

(55) EKX fe i;| 2 < ^ 

Fev 

For F £V, let 

E F :={(F(t),F(t s ))&[n} 2 :t£T'}. 
denote the edges of F and let 

£ F := {{t G r : (F(t),F(t s )) = ('i,j )} : (i, j) € 

Then 

W X (F) = H E|x|' e l. 

e<=£ F 

Noting that E|x| = 0 and letting 

V ■= {F G V : |e| > 2 for all e G S F }, 

we have 

(56) E\u*X k v\ 2 < ]T W U , V (F)W X (F). 

Fev 


Now, for a fixed F G V, let 

V = V F := {F(t) : t G T} 

be the set of vertices. For w G F let m(v) = |F _1 (i;)| denote its multiplicity. Let 
di n (v) := \{x G [n] : (x,u) G 2?}| and d ou t(^) := |{x : (v,x) G F}| denote its 
indegree and outdegree. Finally, let d(v) := di n (v) + d out (v) be the (total) degree 
of v. 


Shown in Figure [3] is an example with k = 4 with the paths (1,2,3,4) and 
(2, 3,4,1). Each vertex has indegree 2 and outdegree 2. 


We will first determine the main term from V and its contribution to (56). 


Lemma 10. Suppose F G V. Then \V\ <k + 1 and that equality occurs only when 
Fp = Fq and \F P \ =\F Q \=k + 1. 
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Figure 3. An example of F £ F with k = 4. 


Fix v £ V F and suppose d(v) = 1. Since each edge has multiplicity at least two, 
we have the following. 


(i) If din(v) = l,v = F(l, 0) = F(2, 0). 

(ii) If d out (v) = 1, v = F( 1, k) = F( 2, k) 

In particular, if two vertices of V have degree 1, then one has outdegree 1, the other 
has indegree 1 and the rest have both outdegree and indegree of at least 1. Since 
\e\ > 2 for each e £ we also have \E F \ < k. Thus 

2k > 2\E f \ = 

v£V 

>l + l + 2(\V\-2) 

= 2(| V| — 1). 

Thus, \V\ < k + 1 with equality occurring only when two of the vertices have degree 
1 and the rest have degree 2. This proves the lemma. 

We let T^main := {F £ V : \V F \ = k + 1}. We also let 

:= {F £ V : \V F \ = k, F P = F Q , F( 1, 0) = F( 2, 0) = F(l, k) = F( 2, k)}. 
Then, the contribution of V m am U to ( [56| is given by 

l u E(i,o)| 2 | u E(i,fe)| 2?7 - fe ^ 1 = n k ~ 1 . 

F(l,0)=F(2,0)e[n] 

F(l,fc)=F(2,fc)e[n] 


We partition the remainder of V in the following way. First let 
Ti := {(1,0), (2,0), (1, k), (2, k)} C T 

be the terms corresponding to the starts and ends of the paths. For t > 0 and P a 
partition Ti with |P| > 2 if t = 0, let 

Pp,t := {F £ V : \V F \ = k — t,F(s) = F(t) s ~p t, s,t £ Ti}. 

Note that we exclude the trivial partition P = {Ti} when t = 0 since P{r 1 } ! o = Fq. 
We let P 0 = Up^ {T i } Fp ,o and for t > 0, we let F t = Up Fp t t- 

Lemma 11. For F £ Ft, W X (F) <C K 2t . 
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Since E|t|°<^°- 4 )+, 

W X {F) < E|x|' e l 

< A' (|e| " 4) +. 

e(z£ f 

It suffices to show that X0e 6 £p(l e l — 4)+ < 2 1. Since at most one vertex has no 
outgoing edge, \£p\ > k — t — 1. Also the |e|’s satisfy J2ee£ F l e l = and M — 2- 
If |e| < 4 for all e € £p, there is nothing to prove. If |ei| >4 say, then 

X>|-4)+ = H-4+X>|-4) + 

f e^ei 

< E(l e l - 2) - 2 

e££F 

<2k-2(k-t-l)-2 = 2 1. 

We now turn to controlling S p j := | X^PePp, W UtV {F )|- To simplify notation, we 
will do this for the specific case P = {{(1,6)}, {(2,0)}, {(1, k), (2, &;)}}. We can 
bound S'p.t by 

E \ u ^ v l\\{ F e Pt ■ F( 1, 0) = * 1} F(2,0) = i 2) F(l,fc) = F(2,fc) = i 3 }\. 

*i,*2,*3e[n] 

distinct 

The cardinality of the last set is independent of the choice of indices i\,i 2 and i 3 , 
and in fact only depends on size of the partition P. We denote it by IVjp|. Removing 
the restriction to distinct indices and using XX \ut\ = 0(y/n), we may bound the 
contribution as nN\p\. 

The case for a general partition is similar and we have the bound 

S P ,t < n cp / 2 N\ P \ 

where cp is the number of singletons in the partition P. To determine iV|p|, we 
first choose the remaining vertices of Vf in ( fc _ t Xip|) ways. We let N 2 = N 2 (t) 
be the maximum number of ways to choose Ep , over P and Vf- Similarly, we let 
N 3 = N 3 (t) be the maximum number of ways to choose £p, over P , Vp and Ep. 
Since 

/ n \ ^ n k-t-\P\k\P\ 

\k-t-\P\) ~ ’ 

we have 

S P ,t < (n c -/ 2 -l p lfcl p l)^-^IV 2 (t)7V3(t), 

with |P| > 2 if t = 0. Considering the possibilities for P and setting 

St := E S r*> 

P partition of T\ 


s t « 


we have 
(57) 


k 2 n k ~ 3 / 2 N 2 N 3 /k\ : t = 0 

kn k ~ t ~ 1 N 2 N 3 /(k — t)\ : t> 1 
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We now estimate N 2 = N 2 (t), the number of ways to choose the set of edges Ep 
for F £ V t . As observed earlier, at least k — t — 1 vertices have positive outdegree, 
and similarly for the indegree. We need to assign at most k oriented edges to the 
k — t vertices such that these conditions are met. Recall d out (i) to be the outdegree 
of vertex i. We will allow for repetitions when choosing the edges to include graphs 
with less than k edges. Hence we may impose the constraint y] ■ * d ou t,(i) = k. 
For at least k — t—1 vertices, d out (i) > 1- This give^jj ways of choosing the 
outdegrees (d 0 ut(*))j=i- To assign the incoming edges of the vertices, we partition 
the k edges into k — t nonempty parts ( E t We first choose k — t edges to belong 
to the different Ei ’s and then we choose parts for each of the remaining edges. This 
can be done in at most (*)(fc — f ) 4 ways. Finally, we assign the k—t parts to the 
vertices with positive indegree. If all k — t vertices have incoming edges, there are 
at most (k — t)\ ways to assign each of them an Ei. Now suppose only k — t — 1 of 
the vertices have incoming edges. First, there are at most (k — t ) 4 ways to choose 2 
vertices and 2 parts, with one vertex being assigned both parts and the other having 
no incoming edges. Next, there are (k — t — 2)! ways of assigning the remaining 
parts to the remaining vertices. Hence 


(58) 


n 2 < 

( k )(' 


\t + lj\ 

< 

fc2t+1 


(i + l)!i! ' 

< 

k 3t+3 (k - 


(.k — tY((k — t)\ + (k — t) 4 (k — t — 2 )!) 


k*{k-t)lk 2 




We now estimate IV 3 = N 3 (t ), the number of ways of choosing £p once Vp and Ep 
have been chosen. Since each vertex has at least one outgoing edge, the maximum 
outdegree of any vertex is at most t + 1. On the other hand, since d ou t(l) + 
... + d out (k — t) < k, at least max(fc — 2t, 0) vertices have d out (i) = 1. At least 
max(2fc — 4t, 0) legs start from these vertices so at most 4 1 legs begin at vertices 
with d out (i) > 1. At each of these legs, we have at most t + 1 choices to make when 
choosing the path. We thus have 

(59) N 3 <(t+ l) 44 , 


which is independent of the chosen vertices and edges. 


For t = 0, using (571, (58) and 


we have 


S 0 < n k ~ 3/2 k 2 /k\N 2 (0)N 3 (0) 
< fcV'- 3 / 2 . 


Since W X (F) = 0(1) for t 


0, the contribution to (56) is o(n 


k—l\ 


lr This follows from the standard stars and bars combinatorial argument; see [9]. 
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For t > 1, we have 


S t K 2t < n 1 *-*- 1 kN 2 N 3 K 2t / (k - t)\ 



where we have used the estimates i! > and < e. For k = o(ra e / 5 ), the last 


expression is decreasing for t < k and bounding each term by the bound for the 
t = 1 term, we have 


StK 2t <C fc V _1_e = o(n fe_1 ) 


for k = o(n e / 7 ). 


□ 


4. Proof of Theorem Q] 


Proof. We will work on the event 


E = E n = {p(X) < 1 + e, A > 1 + 2e for all A £ |J A®} 



denote the resolvent of X/yJn. On E, A > p(X), so we may expand R\ as a 
Neumann series 



We write the Jordan decomposition of A as A = V JU* where V (resp. U*) is the 
n x rk(.A) (resp. rk(A) x n) matrix of generalized right (resp. left) eigenvectors of 
A associated to nonzero eigenvalues of A satisfying U*V = 1 and J is the Jordan 
matrix of A restricted to nonzero eigenvalues with size rk(A) x rk(A). Starting with 
the eigenvalue equation det(^= + A — A) =0 and using the determinant identity 
det(l + AB) = det(l + BA ), we have 






30 


ANAND B. RAJAGOPALAN 


Let Jg be the block matrix of J corresponding to eigenvalue 9 and let Ug and Vg 
be the restrictions of U* and V to the generalized left and right eigenvectors of 9 
respectively. Recall Proposition [2] as well as the notation used therein. We apply 
Proposition j5j with M = Jg and P = P e = ~^Ug S\VgJg. 

First note that for each column indexed by t £ IJget = 9et, where e* is the 
coordinate vector corresponding to t. Hence for s £ J® and t £ 7®, 

Pst = A=ev*s x vt. 

\Jn 

Observe that the moment assumption made in Theorem [4] guarantees the applica¬ 
bility of Lemma [7] to the collection 

{VnPst '■ s £ t £ Iy, 9 £ 0 }. 


By LemmaFJ (y/nPg t ) Sitf g => ( F r ) rel2 defined by ([8]) , ([4]) and Q. Finally, applying 
Proposition [2] yields the procedure to determine the fluctuations as specified in 
Theorem [4] □ 


Appendix A. 


In this section we state the deterministic perturbation result referred to in the 
proof of Theorem [I] It is originally attributed to Lidskii. See [14] and references 
cited within. We remind the reader that the Schur complement of A in the block 


matrix 


A B 
C D 


is D — CA~ l B. 


Proposition 2. Let M be a d x d deterministic matrix in Jordan form. For 
notational simplicity, we will assume M has a single eigenvalue 9. Let 


Jk '■— 


1 

9 1 

1 

9 


denote the k x k Jordan block and write 


K 

M = 0 J® mk 

k —1 

Hence for each k £ [K\, M has mJordan blocks JLet P n be a sequence of dx d 
perturbation matrices with entries of size o(l). Then M + P n has spectrum 


A(M + P n ) = {A k,m,i ■ k £ \K\,m £ [m k ],i £ [k]} 
with Xk,m,i —> 9 for all k £ K, m £ [nik\ and i £ [k]. The fluctuations 


fk,m,i ■ A k,m,i @ 
are given by the following procedure. 
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Let Cfc := Y^j =i m j an d se t c := °k- Decompose P = P n into c 2 blocks ( Bij )f jJ=1 
with the c diagonal blocks (Biy)i=i having sizes 

1,..., 1,2- ,2,... ,K,... K 

with k occurring with multiplicity to*,. Let ki x kj denote the size of block Bi j. This 
block decomposition is conformal with that of M induced by the Jk’s. Let R = R n 
be the submatrix of P of size c x c with entries given by 

Bij ( Bij ) [ ■ 

Hence R is formed from the lower left elements of the blocks in the decomposition 
ofP. 

Let Ek = R Ck xc k he upper left submatrices of R and let Fk be the nik x m*, Schur 
complement of Ek-i in Ek, where we set F\ := E\. Then, to leading order, the 
fluctuations fk,m,i are given by the k k-th roots of the nik eigenvalues of Fk for 
each k £ [K]. If M has multiple eigenvalues, we apply the above procedure to each 
eigenvalue separately. 


We remark on a few special cases of Proposition [2j We denote the entries of 
P = P n by pij and assume p = O(-S^) (as will turn out to be the case in our 
applications). 

(1) Suppose M = diag($i ,... ,6a) is diagonal with distinct eigenvalues. Let A j 
denote the corresponding eigenvalues of M + P in the sense that A j —» 9j 
as n —> oo Then 

fj := A? ~ = Pjjfi + °(1))- 

(2) Suppose M = did■ Then {y/n(Xj — 0)}^_i converge to the d eigenvalues of 
y/nP. 

(3) Suppose M = Jd{9 ). Then {n^(Xj — 9)}j =1 converge to the d roots of 
y/nPk i. 


Appendix B. 


In this appendix, we extend the results involving the moment method, namely 
Proposition [l] and Lemma [T] using a truncation argument (see j.l]). Consider the 
following two assumptions on the atom distribution x. 


(i) M < K = 0(n M ). 

(ii) Elxl 1 ” < oo, to = 2/M. 


We show that if Proposition [I] and Lemma [9] hold for (i) with M < 1/2, then they 
hold for |(ii)[ 


Suppose we have |(ii)| with to > 4, corresponding to M = 2/to < 1/2. We first 
show that the event 

{\xij\ < n M for all i,j £ [n]} 
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occurs w.h.p. Indeed, we have 

(60) P \\xij\ > n M some i,j £ [n]] < n 2 P [|x| m > n 2 ] . 

Since n 2 l | x | m >„2 < \x\ m and E|x| m < oo, the last expression converges to 0 by the 
dominated convergence theorem. 


Now define the truncated random variables x := xl \ x \<n M and X = (X)ij by 
Xij := Xij. While x is bounded, it no longer has mean zero. On the other hand, 
for n sufficiently large, we have 

I Ex | < EIzI^i^mI 
— j^{rn— 1)M 

(61) < n~ 3/2 . 


By Schur’s test for the operator norm of a matrix, we have 

(62) ||EA'|| = Oin- 1 / 2 ). 


Now let x \= x — Ex and X := X — EA denote the truncated and centered random 
variables. By construction, Ex = 0. Furthermore, 


(63) 


E|x | 2 = E|x | 2 — |Ex | 2 —>• E|x | 2 = 1 


by (611 and dominated convergence. Given 
Proposition 0" valid for A'. Since E|x | 2 < E|x | 2 
prove the validity of Proposition [l] and Lemma [7] for x under 
the following. 


, it is easy to check that under (i) 
Lemma 7| al so valid for x. To 
(ii)[ it suffices to prove 


Lemma 12. Suppose u = u n and v = v n are unit vectors in C". Then for every 
7 > 0 , the event 

A n ^ := (J {\u*X k v - u*X k v\ > 7 n {k ~ 1)/2 } 

k< log 2 n 


occurs w.h.p. 


We first state a result that is a consequence of the proof in [2], Following the 
notation of [2] we define 5 := n M-1 / 2 so that |x| < 6y/n. Fix z > k + 1 and p a 
positive integer. Then 



< z~ 2p n~ pk ETr [x k (A fc )*) P 
=: z~ 2p n~ pk E n . 


In 0(pg. 561), it is shown that 

pk 


E n < n‘ 


kp+-. 


E 

1=1 


2 kp 
21 


(k + 1 ) 


2kp— 2Z+2p 


(2 kp) 


loe- s ^ 

10 & (2 kp) 3 


6kp—6l 


fikp-l 
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In our application, k < log 2 n and choosing p = S 1 ^ 7 say, we have 

6 kpS 1 / 6 


(64) 


losr S ^ 

10 0 (2 kp) 3 


-> 0 . 


In fact, the left-hand side of ( |64| is less than 1 for n > N(m). 
For such n, following [2](pg. 562), it then follows that 


z~ 2p n~ pk E n < ^(2kpn 2 ) 1/p (l + (jfe + l)S 1/2 ) 2k ^ 


k + 1 


2 \ P 


Choosing z = 3k say, for any k < log“ n we have 


(65) 


( 1 ~\ fc 


—=x 

> 3k 

\Vn J 



= 0(e~ n °) 


for some c = c(?n) > 0. We now turn to the proof of Lemma 12 


Proof. By (|65|), we may assume 


(**)' 


w.h.p. We will need the crude bound 


< 3k for all k < log 2 n which occurs 


( 66 ) 

We then have 
1 


E Y[ai<n 2k . 


ai + ...+flfc=n i —1 
a*> 0 


(fc-l)/2 


U 


n 


X k v - u*X k v\ < n- (fe - 1)/2 ||(X + EX) fe - X k 


k i Z+l 


l—l l , =Oai-\-...-\-a l /=k—li=l 

l 1 +1 

sE^E(“ 2 )'' 


- 1 /! 
k= 1 V =0 


<r l + 4 f 3fc 2 \ + _ 


where we have used (66) and (621 in the third line. 


\fn 


||EX||* 


□ 
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